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I A NOVEL GENE FOR CONTROLLING LEAP SHAPES 
BACKGROUND OF THE INVENTION 

1. FIELD OF THE INVENTIONS 

The present Invention relates to a novel gene. In 
5 particular, the present invention relates to a novel gene 
in plants which encodes a protein having the function of 
controlling leaf shapes. 

.» 

2. DESCRIPTION OF THE RELATED ART? 

10 

Transposons are mutagenic genes which are known to 
be ubiquitous in animal, yeast, bacterial, and plant genomes. 
Transposons are classified into two classes. Class I and 
Class II, depending on their transposition mechanisms. 

15 Transposons belonging to Class II are transposed in the form 
of DNAs without toeing replicated . Known Class II 
transposons Include the Ao/Ds, Spm/dSpm and Mu elements of 
Zea mays (Fedoroff f 1989, Cell 56, 181-191* Fedorof f et al. , 
1983, Cell 35, 235-242; Schlefelbein et al., 1985, Proc, 

20 Natl. Acad* Sci. USA 82, 4783-4787), and the Tam element 
of Antirrhinum majus (Bonas et al., 1984, EMBO J., 3, 
1015-1019). Class II transposons are widely used for gene 
isolation techniques which utilize transpoeon tagging. 
Such techniques utilize the fact that a transposon induces 

25 physiological and morphological changes when inserted into 
genes. The affected gene can be isolated by detecting such 
changes (Bancroft et al., 1993, The Plant Cell, 5, 631- 
638; Colasanti et al., 1998, Cell, 93, 593-603; Gray et al., 
1997, Cell, 89, 25-31; Keddie et al., 1998, The Plant Cell, 

30 10, 877-887; Whitham et al., 1994, Cell, 78, 1101-1115). 

Transposons belonging to Class I, also referred to 
as retrotransposons, are replicated and transposed via RNA 
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intermediates. Class I transposons vere first identified 
and characterized in Drosophila and in yeasts. However, 
recent studies have revealed that Class I transposons are 
ubiquitous in plant genomes and account for a substantial 
portion of the genomes (Bennetzen, 1996, Trends Microbiolo . , 
4, 347-353; Voytas, 1996, Science, 274, 737-738). A large 
majority of retrotransposons appear to be inactive. Recent 
studies indicate that some of these retrotransposons are 
activated under stress conditions such as injuries, 
pathogenic attacks, or cell culture (Grandbastlen, 1998, 
Trends in Plant Science, 3, 181-187; Wessler, 1996, Curr. 
Biol. 6, 959-961; Wessler et al. , 1995, Curr. Opin. Genet. 
Devel, 5 , 814-821) » Activation under stress conditions has 
been reported for TntlA and Ttol in tobacco (Pouteau et al. , 
1994, Plant J., 5, 535-542; Takeda et al., 1988, Plant Mol. 
Biol., 36, 365-376), and Tosl7 in rice (Hirochika et al., 
1996, Proo. Natl. Acad. Sci. USA, 93, 7783-7788), for 
example „ 

The Tosl7 retrotransposon of rice is one of the 
most -extensively studied plant Class I elements in plants. 
Tosl7 was cloned by an RT-PCR method using a degenerate 
primer prepared based on a conservative amino acid sequence 
in reverse transcription enzyme domains between Tyl-copia 
retroelements (Hirochika et al., 1992, Mol. Gen. Genet., 
233, 209-216). Tosl7 is 4.3kblong, and has two 138 bp LTRs 
(long chain terminal repetitions) and PBS (primer binding 
sites) complementary to the 3 f end of the start methionine 
tRNA (Hirochika et al., 1996, supra). Tosl7 transcrlpti n 
is strongly activated through tissue culture, and its copy 
number increases with culture time. In Nipponbare. a model 
Japonica cultivar used for genome analysis, two copies of 
Toel7 are initially present, which are Increased to 5 to 
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30 copies in a regenerated plant after tissue culture 
(Hirochika et al., 1996 , supra). Unlike Class II 
transposons which were characterized in yeasts and 
Droeophila, Tosl7 is transposed in chromosomes in random 
manners and causes stable mutation, and therefore provides 
a powerful tool for functional analysis of rice genes 
(Hirochika, 1997. Plant Mol. Biol. 35, 231-240; 1999, 
Molecular Biology of Rice (ed. by K. Shimamoto, 
Springer -Verlag, 43-58) • 

SUMMARY OP THE INVENTION 

The present invention relates to a polynucleotide 
encoding a plant gene capable of controlling leaf shapes, 
the polynucleotide encoding an amino acid sequence from M t 
at position 1 to Val at position 690 of SEQ ID NO: 2 in the 
SEQUENCE LISTING, including any polynucleotide encoding an 
amino acid sequence in which one or more amino acids are 
deleted, substituted or added to the amino acid sequence. 

In one embodiment of the invention, the 
polynucleotide may be derived from rice. 

In another embodiment of the invention, the 
polynucleotide may be as represented by SEQ ID NO: 1 in the 
SEQUENCE LISTING* 

The present invention further relates to methods for 
controlling leaf shapes in plants. 

The inventors diligently conducted systematic 
analyses of pheno types of plants having a newly transposed 
To 17 copy and sequences adjoining Tosl7 target sites with 
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respect to rice. As a result, the inventors found a 
narrow-leaf rice mutation obtained from Toel7 insertion, 
and isolated the gene responsible for this mutation by 
utilizing Tosl7 as a tag, thereby accomplishing the present 
invention. 

Thus, the invention described herein makes possibl 
the advantage of: providing a novel plant gene which can 
be provided by using Tosl7. 

This and other advantages of the present invention 
will become apparent to those skilled in the art upon reading 
and understanding the following detailed description with 
reference to the accompanying figures . 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a photograph showing a Tosl7-inserted 
narrow- leaf mutant rice plant (left) and a wild- type rice 
plant (right) * 

Figure 2 shows a Southern analysis autoradiogram of 
DNA extracted from self -crossed progeny from a narrow-leaf 
mutant NC0608 strain (R2 generation) and DNA extracted from 
a wild- type rice. On the left is shown a autoradiogram of- 
a Southern analysis performed by using Tosl7 as a probe, 
on the right is shown an autoradiogram of a Southern analysis 
performed by suboloning NC0608_0_102, which is one of the 
adjoining sequences of Tosl7, and using it as a probe. The 
lane indicated as M is a lane of a A/Hindlll marker* The 
lane indicated as C is a control lane in which DNA obtained 
from a wild- type plant (Nipponbare) was elect rophoresed. 
The lane indicated as mt is a lane in which DNA obtained 
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from a narrow- leaf mutant was elect rophoresed. 

Figure 3 is* a schematic representation of a gene 
which control leaf shapes. Blank boxes in the figure 
represent introns, whereas black boxes represent exons . 
The downward arrow on the right-hand side of the figure 
represent a position at which Tosl7 was inserted. The two 
small downward arrows near the 5 1 end and the 3 1 end represent 
a start codon site and a stop codon site, respectively. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention provides a novel plant gene 
which can be provided by using Tosl7, a vector containing 
the same, a plant which is transformed by the novel gene, 
and a method of producing an improved plant including a st p 
of transforming a plant with the novel gene. 

According to the present invention, there is 
provided a polynucleotide encoding a plant gene capable of 
controlling leaf shapes. As used herein, the term 
w controlling leaf shapes" means the ability to alter the 
leaf length and/ or leaf width of a plant , thereby enhancing 
photosynthesis ability or imparting resistance against 
lodging, etc The term "plants" encompasses both 
monocotyledons and dicotyledons . 

A polynucleotide encoding a plant gene capable of 
controlling leaf shapes aaaording to the present invention 
is, for example, a polynucleotide encoding an amino acid 
sequence from Met at position 1 to Val at position 690 of 
SEQ ID NO: 2 in the SEQUENCE LISTING, including any 
polynucleotide encoding an amino acid sequence in which one 
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or more amino acids are deleted, substituted or added to 
the aforementioned amino acid sequence. 

A polynucleotide encoding a plant gene capable of 
5 controlling leaf shapes encompasses any polynucleotides 
which have at least about 80% sequence homology , preferably 
at least about 85% sequence homology/ and more preferably 
at least about 90% sequence homology, still more preferably 
at least about 95% sequence homology, and most preferably 

10 at least about 99% sequence homology, with an amino acid 
sequence from Met at position 1 to Val at position 690 of 
SEQ ID NO: 2 in the SEQUENCE LISTING, so long as they are 
capable of controlling leaf shapes in plants • The term 
"sequence homology* indicates a degree of identicalness 

15 between two polynucleotide sequences to be compared with 
each other* The rate (%) of sequence homology between two 
polynucleotide sequences for comparison is calculated by, 
after optimally aligning the two polynucleotide sequences 
for comparison, obtaining a matched position number 

20 indicating the number of positions at which identical, or 
"matched* , nucleic acid bases (e.g., A, T, C,G, U, or I) 
are present in both sequences , dividing the matched position 
number by total number of bases In the polynucleotide 
sequences for comparison, and multiplying the quotient by 

25 100. The sequence homology can be calculated by using the 
following sequencing tools, for example: a Unix base program 
designated GCG Wisconsin Package (Program Manual for the 
Wisconsin Package, Version 8, September 1994, Genetics 
Computer Group, 575 Science Drive Madison, Wisconsin, USA 

30 53711? Rice, P. (1996) Program Manual for EGCG Package , Peter 
Rice, The Sanger Centre, Hinxton Hall, Cambridge, CB10 IRQ, 
England) , and the ExPASy World Wide Web molecular biology 
server ( Geneva University Hospital and University of Geneva , 
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Geneva, Switzerland) * 

The -term w control sequence 9 ae used herein refers 
to a DNA sequence Including a functional promoter and any 
related transcription elements (e.g., an enhancer, CCAAT 
box, TATA box, SPI site, etc.). 

The term "operably linked" as used herein refers to 
a manner of linking a polynucleotide such that various 
regulation elements such as a promoter, enhancer , etc., 
which regulate its expression can operate within a host cell . 

It is well-known to those skilled in the art that 
the type and kinds of control sequences may vary depending 
on the host cell. For example, CaMV35S promoter, nopaline 
synthase promoter, and the like are well-known to those 
skilled in the art. Any methods that are known to those 
Skilled in the art may be used for introducing the gene into 
a plant body. For example, methods which utilize 
agrobaoterium and methods which directly introduce a gene 
in a cell are well known. As for methods which utilize 
agrobacterium, the method of Nagel et al. (Microbiol. Lett. 
67, 325 (1990)) may be used, for example. This method 
involves first transforming agrobacterium with an 
expression vector via electroporation , and then introducing 
the transformed agrobacterium into a plant cell by following 
a method described in Plant Molecular Biology Manual (S.B. 
Gelvin et al. , Academic Press Publishers) . Electroporation 
techniques and partile gun techniques are known as methods 
for directly introducing a gene into a cell. 

Cells into which genes have been introduced are 
first selected based on drug resistance, e.g., hygromyoin 
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resistance, and then regenerated into plant bodies by using 
usual methods- 

The terminology and laboratory procedures describ d 
5 throughout the present specification are directed to those 
which are well-known and commonly employed in the art . 
Standard techniques may be used for recombination methods, 
polynucleotide synthesis, microorganisms culturlng, and 
transformation (e.g., electroporation) . Such techniques 

10 and procedures are generally known from various standard 
textbooks available in the field or by way of the present 
specification (including a generally-referenced textbook 
by Sambrook et al. , Molecular Cloning: A Laboratory Manual, 
2nd edition ( 1989) Cold Spring Harbor Laboratory Press, Cold 

15 Spring Harbor , N.Y.). Such literature is incorporated 
herein by reference. 

The polynucleotide according to the present 
invention can be obtained by using the method described 

20 herein, for example. However, the polynucleotide according 
to the present invention may also be obtained by any chemical 
synthesis process based on the sequence disclosed her in. 
For example, the polynucleotide according to the present 
invention may be synthesized by using a polynucleotide 

25 synthesizer available from Applied Bio Systems in accordance 
with the instructions provided by the manufacturer. 

Methods of PCR amplification are well-known in the 
art (PCR Technology: Principles and Applications for DNA 
30 Amplif ication, ed. HA Erlioh, Freeman Press, NewYork, NY 
(1992) ; PCR Protocols: A Guide to Methods and Applications , 
Innis, Gelfland, Snlsky, and White, Academic Press, San 
Diego, CA(1990); Mattila et al. (1991) Nucleic Acids REs. 
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195 4967; Eckert, K.A. and Kunkel, T. A. (1991) PCR Methods 
ana Applicant axons It 17 j PCR, Mcpherson, Quirkes, ana Taylor, 
1RL Press, Oxford) . Such literature is incorporated herein 
by reference, 

5 

(Examples) 

Hereinafter, the present invention will be 
described by way of examples which are of illustrative but 
not limitative nature/ 

10 

(Example 1: Activation of Tosl7 via culture 

Using fully ripened seeds of Nipponbare, which is 
a variety of Japonica subspecies, induction of calli and 
cell suspension culture were carried out as described 

15 earlier (Hirochika et al. , 1996, supra) . The activation of 
Tosl7 was carried out following the method of Ohtsuki (1990) 
(rice protoplast culture system. Food and Agricultural 
Research Development Association). In summary, fully 
ripened seeds of rice were cultured in an MS medium having 

20 2,4-dlchlorophenoxyacetic acid (2,4-D) added thereto 
(2 mg/ml) (Ohtsuki (1990), supra) (25*0, 1 month), to Induce 
callus formation. The resultant calluses were cultured for 
5 months in an N6 liquid medium having 2,4-D added thereto 
(Ohtsuki (1990), supra), and thereafter placed on a 

25 rediff erentiatlon medium (Ohtsuki (1990), supra), wher by 
redif f erentiated rice plants were obtained (first 
generation (Rl) plants). 

(Example 2i Isolation and Identification of narrow- leaf 
30 mutants 

Utilizing each of the regenerated Rl rice plants 
obtained according to Example 1, about 1000 Rl seeds were 
collected from each strain and sown on a paddy field to obtain 
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second generation (R2) plants , which were subjected to a 
morphological analysis. As a result of observing the 
phenotypee of the respective plant bodies in the R2 group, 
it was learned that about 1/4 of the R2 group of the NC0608 
strain exhibit the M narrow- leaf * phenotype (Figure 1) . In 
the paddy field, the Tosl7-inserted narrow-leaf mutants had 
their leaf length reduced to about 90% in the flag leaf and 
all leaves down to the third leaf therefrom? and they also 
had their leaf width reduced to about 78%, about 70% , about 
71%, about 69% , respectively, in the flag leaf and all leav s 
down to the third leaf therefrom (Figure 1, left), as 
compared with the wild type (Figure 1, right). This 
suggested that the narrow-leaf phenotype of NC0608 1© caused 
by recessive mutation at a single gene locus. 

(Example 3; Isolation of causative gene for narrow-leaf 
mutations ) 

In order to Identify and isolate the causative gene 
for narrow- leaf mutations from the NC060B strain obtained 
according to Example 2, linkage analysis with respect to 
the Tosl7 gene was performed on a group part of which was 
separable as narrow-leaf mutations. In order to show that 
recessive mutation at a single gene locus is responsible 
for the mutations, adjoining portions of a target site (Ts) 
of the NC0608 strain at which Tosl7 had been transpos - 
inserted were amplified first. 

From the group of R2 rice plants (self -cross d 
progeny from the NC0608 strain) obtained according to 
Example 2, Individuals exhibiting mutation were identified 
from normal individuals • DNA was prepared from both groups 
of individuals by using a CTAB method (Murray and Thompson, 
1980, Nucleic Acids Res. 8, 4321-4325). The DNA obtained 
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from individuals exhibiting narrow- leaf mutation and the 
DNA obtained from normal individuals were each digested with 
restriction enzyme Xbal , and after agarose electrophoresis , 
were allowed to adsorb to nylon membranes, DNA fragments 
which were obtained from Toel7 through digestion by Xbal 
and BamHI were labeled with 3a P-dCTP* By using these as 
probes, a Southern hybridization was performed (Figure 2, 
left). As seen from the Southern analysis autoradiogram 
shown on the left-hand side in Figure 2 # it was learned the 
Tosl7 band (about 6600 bp) indicated by an arrow was observed 
in narrow-leaf mutations as a homozygous band, but not in 
normal individuals , and that the Tosl7 band indicated by 
the arrow was completely linked with the narrow-leaf 
mutation phenotype. From these results, it was conclud d 
that the DNA which is represented by the band which 
hybridizes to the Tosl7 probe indicated by the arrow 
contains a causative gene, such that Tosl7, when inserted 
in a genome region represented by this band, generates 
narrow- leaf mutations as the genotype becomes homozygous. 
Accordingly/ a portion of the causative gene for the 
narrow-leaf mutations, i.e., a sequence adjoining Tosl7, 
was isolated through TAIL-PCR reactions using this DNA as 
a template. The amplification of the Tosl7 target site 
sequence was accomplished by TAIL-PCR employing the total 
DNA (Liu Y-G. et al. , 1995, Genomics, 25, 674-681, Liu Y-G. 
et al., 1995, Plant J . , 8, 457-463). In summary, by using 
as a template the total DNA from a regenerated plant having 
a new Tosl7 target site, three TAIL-PCR amplification 
reactions were performed, using the following three sets 
of primers: (1st reaction) Toel7 Tall3, 

GAGAGCATCATCGGTTACATCTTCTC and AD1 ( arbitrarily 
degenerated primer 1) NGTCGA (G/C) (A/T) GANA <A/T) GAA; 
(2nd reaction) Tosl7 Tail4, ATCCACCTTGAGTTTGAAGGG and AD1; 
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and (3rd reaction) Tosl7 Tails, CATCGGATGTCCAGTCCATTG and 
AD1. Next, the respective TAIL-PCR products were subjected 
to an agarose electrophoresis and then a simple column 
purification. By directly applying them to a sequencer 
5 (Model 377 available from ABI), sequencing was performed. 

Four new target sites (Ts) for Tosl7 insertion were 
identified as a result of sequencing the adjoining sequences 
of Tosi7 in the NC0608 strain. 

10 

Next, a Southern analysis was performed by 
subcloning NC0608_0_ 102, one of the adjoining sequences of 
Tosl7, and using it as a probe. The results are shown on 
the right-hand side in Figure 2. As seen from the 

15 autoradlogram on the right-hand side in Figure 2, the 
Tos 17 -adjoining sequence NC0608_0_102 hybridized to the DNA 
fragment located at the same position as that indicated in 
the Southern analysis in which Tosl7 was used as a probe. 
The results were consistent for all of the 62 strains that 

20 were examined. This indicates that the subclone 
NC060B_0_102 contains a portion of the causative gene for 
the narrow-leaf mutation, and that NC0608_0_102 is an 
adjoining sequence of the causative gene for the narrow-leaf 
mutation. 

25 

(Example 4s Structural analysis of the causative gene for. 
narrow -leaf mutation) 

Relying on the adjoining sequence obtained 
according to Example 3 , the inventors attempted to determine 
30 the complete structure of cDNA which was transcribed from 
the gene containing the adjoining sequence NC0608_0_102 
through a PGR screening using a cDNA library and Cap Site 
cDNA (Nlppongene) . By using the wild- type (Nipponbare) DNA 
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as a template, the inventors attempted to determine the 
complete structure of the genomic DNA of the gene containing 
NC060-8_0_102 through a PCR using a primer which is design d 
from the cDNA and through the aforementioned TAIL-PCR. 

The cDNA library was previously prepared in the 
laboratory of the inventors . The method of preparation can 
be summarized as follows . First , by using an ISOGEN solution 
{ Nippongene ) , the total RNA was extracted from a callus of 
a wild-type rice plant which had been oultured in the 
aforementioned MS medium. By using an oligo(dT) cellulose 
column contained in an mRNA purification kit (Stratagen ) , 
poly(A)xnRNA was obtained from the total RNA. Following 
usual methods, cDNA was synthesized from the resultant 
poly (A) mRNA . Thus , a cDNA library was constructed in a Hybri 
ZAP-II vector (Stratagene) * 

The cDNA and genomic DNA of the gene containing the 
adjoining sequence NC0608_0_ 102 were partially amplified 
through the below- described four- step PCR reactions and 
three- step PCR reactions , respectively. All of the 
amplified fragments were sequenced by using a 377 seguenc r 
(Perkin Elmer) for both directions. 

(cDNA) 

First step i Using cDNA library as a template, a PCR 
reaation was carried out by using a pair of primers specif ia 
to the adjoining sequence NC0608_0_102 to confirm that a 
portion of this adjoining sequence is contained in the cDNA 
library : NC 0 6 0 8_0_1 0 2F ACGGAGACACCTCGT AAACC and 
NC0608 0 102R1 AAGGCCGACTATTGTTGACC . 



14 - 



AAUZX 



Second step: Using the cDNA library as a template , 
a PCR reaction was carried out by using NC0608_0_102F and 
Hybri ZAP B (Stratagene) , which is a primer specific to Hybri 
ZAP-II vector- Thus , a fragment which partially overlaps 
with NC0608_o_l02 and which contains the 3 f region of cDNA 
along with the poly(A) binding site was obtained. 

Third step: Using the cDNA library as a templat , 
a PCR reaction was carried out by using Hybri ZAP A 
(Stratagene), which is a primer specific to Hybri ZAP-II 
vector , and NC0608_0_102R2 CCTGCAATGTTACCTCTGGC , which is 
a primer specific to NC0608_0_102 . Thus . a 5 1 fragment which 
partially overlaps with NC0608_0_102 was obtained* 

Fourth step: Using Cap Site cDNA (Nlpponegne) as a 
template , a PCR reaction was carried out by using 1RC2 
(Nippongene) , which is a primer specific to Cap Site, and 
TGACAGGTCAGACTGATCAACCGG, which is a primer specific to the 
fragment obtained in the third step- Thus, a fragment which 
partially overlaps with the fragment obtained in the third 
step and which contains the 5 * region of cDNA along with 
the transcription start point (cap site) • 



(Genomic DNA) 

First step: Using the total DNA of Nipponbare, two 
reactions of TAIL -PCR were carried out using the following 
two sets of primers to obtain a 5 ■ fragment which partially 
overlaps with the NC0608_0_102: (first reactions 
NC0608_0_102R2 and AD1 employed in Example 3; second 
reaction: NC0608_0_102R3 TAGGCAATCCGGCAATGTCC and AD1) 

Second step: Using the total DNA of Nipponbare, a 
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PGR reaction was carried out using a primer 
(CTAGAAGCAAAATCTTGAAGCTGC) which is specific to the 
fragment obtained in the first step and a primer 
(AGTGTTCTTCGCACCTCGCG) which is specific to the cDNA 
fragment obtained in the fourth step PCR. Thus, a 5* 
fragment which partially overlaps with the fragment obtained 
in the first step was obtained. 

Third step: Using the total DNA of Nipponbare, a PCR 
reaction was carried out using a prim r 
(TGCCTCGCCCTCGGCGATGG) which is specific to the fragment 
obtained in the second step and a primer 
(AATATTTCAAATCACACTAC) which is specific to the 5* region 
of the cDNA fragment obtained in the fourth step PCR. Thus, 
a 5 1 fragment which partially overlaps with the fragm nt 
obtained in the seoond step was obtained. 

The cDNA and genomic DNA structures of the 
narrow- leaf gene are shown together in Figure 3. This gene 
has 11 introns and encodes 690 amino acids, and yet finds 
no similar genes registered in existing databases. Thus, 
it was confirmed that this gene Is novel. It was learned 
that Tosl7 had been inserted between the 9th and the 10th 
bases from the 5* end of the 12th exon region. An amino acid 
sequence encoded by this gene showed very high homology with 
a gene in Arab idop sis thaliana having an unknown function. 

The above examples are Illustrative, and by no means 
limitative, of various aspects of the present invention and 
the manners in which the oligonucleotide according to the 
present invention can be made and utilized. 

Thus, according to the present invention, a n vel 
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polynuoleotide Is provided which is capable of controlling 
leaf shapes, the polynucleotide being of use in plant 
breeding- By introducing the present polynucleotide into 
plants and artificially controlling leaf shapes, it is 
5 expected that enhancement of photosynthesis ability or 
provision of resistance against lodging, etc., can be 
attained. 

Various other modifications will be apparent to and 
10 can be readily made by those sXilled in the art without 
departing from the scope and spirit of this invention* 
Accordingly, it is not intended that the scope of the claims 
appended hereto be limited to the description as set forth 
herein, but rather that the claims be broadly construed. 
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SEQUENCE LISTING 

<110> National Institute of Agrobiological Resources, Ministry of Agriculture, 
forestry and Fisheries 

Bio-Oriented Technology Research Advancement Institution Society for 
Techno-Innovation of Agriculture. Forestry, and Fisheries 

<120> A novel gene which controls leaf shape and size 

<130> 

<140> JP 

<141> 2000-03-23 
<160> 3 

<170> Patent in Ver. 2.1 

<210> 1 
<211> 2463 
<212> DNA 

<213> Oryza sativa 

<220> 
<221> CDS 

<222> (198).. (2270) 

<300> 

<400> 1 

aaaaaa&tat ttcaaatcac actacactct ccgtcgtctc ctctcctctc ctctcctccc 60 



cctctcctcc gcctctctcg catctgaggc tccgatcgcc ggcgacccca gccagaatcc 120 
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gccgccccgt ctcgccctcc ccgctcgacg agaccgcgcc gagcggcgaa gaggcctagt 180 

gttctlcgca cctcgcg atg agt age gcg gig aag gac cag ctt cac cag 230 
Met Ser Ser Ala Val Lys Asp Gin Leu His Gin 
1 S 10 

atg teg acg aca tgc gat teg ctt eta ctg gag etc aat gtg att tgg 278 
Met Ser Thr Thr Cys Asp Ser Leu leu Leu Glu Leu Asa Val lie Trp 
15 20 25 

gat gag gtc ggt gag ccc gac aeg acg agg gac agg atg ctg ctg gag 326 
Asp Gin Yal Gly Glu Pro Asp Thr Thr Arg Asp Arg Met Leu Leu Glu 
30 36 40 

etc gag cag gag tgc ctg gag gtc tac agg egg aag gtc gac cag gcg 374 
Leu Glu Gin Glu Cys Leu Glu Val Tyr Arg Arg Lys Val Asp Gin Ala 
45 50 55 

aac egg age cgc gee cag ctg egg aag gec ate gee gag ggc gag gca 422 
Asu Arg Ser Arg Ala Gin Leu Arg Lys Ala He Ala Glu Gly Glu Ala 
60 65 70 75 

gag etc gee ggc ate tgc tea gec atg ggc gag ccg ccc gtg cac gtt 470 
Glu Leu Ala Gly lie Cys Ser Ala Met Gly Glu Pro Pro Val His Val 
80 85 90 

aga cag tea aat cag aag ctt eat ggc tta aga gag gag ttg aat gca 518 
Arg Gin Ser Asn Gin Lys Leu His Gly Leu Arg Glu Glu Leu Asn Ala 
95 100 105 



att gtt ccg tat ttg gaa gaa atg aaa aag aaa aag gtc gaa cga tgg 566 
He Val Pro Tyr Leu Glu Glu Met Lys Lys Lys Lys Val Glu Arg Trp 
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110. 



115 



120 



aac cag tti git cat gtc ata gag cag att aag aaa att teg let gaa 
Asn Gin Phe Val His Val lie Glu Gin He Lys Lys lie Ser Ser Glu 
125 130 13S 



ata agg cca gec gat ttt gtt ccc ttt aaa gtt ccg gtt gat cag tct 
lie Arg Pro Ala Asp Phe Yal Pro Phe Lys Yal Pro Yal Asp Glu Ser 
140 145 150 155 

gac ctg tea tta aga aag ctt gat gag ttg acg aag gac ctg gna tec 
Asp Leu Ser Leu Arg Lys Leu Asp Glu Leu Thr Lys Asp Leu Xaa Ser 
160 165 170 



ctt cag aag gag aag age gat egg eta aag caa gtg ata gaa cat ttg 
Leu Glu Lys Glu Lys Ser Asp Arg Leu Lys Glu Yal He Glu His Leu 
176 180 185 



aat tct ttg cat tec tta tgt gag gtg ctt ggc ata gat ttc aas caa 
Asn Ser Leu His Ser Leu Cys Glu Yal Leu Gly He Asp Phe Lys Gin 
190 195 200 

aca gta tat gag gtg cac cct age ttg gac gaa get gaa gga tea aag 
Thr Val Tyr Glu Val His Pro Ser Leu Asp Glu Ala Glu Gly Ser Lys 
205 210 215 



aac ctg age aac act aca att gag agg ctt get get gec gca aac aga 
Asu Leu Ser Asn Thr Thr He Glu Arg Leu Ala Ala Ala Ala Asn Arg 
220 225 230 235 



ctg cgt gaa atg aag ate caa agg atg caa aag ctt caa gat ttt get 
Leu Arg Glu Met Lys He Gin Arg Met Gin Lys Leu Gin Asp Phe Ala 
240 245 250 
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tct age atg etc gag cia tgg aat etc atg gat act cca ctt gaa gag 998 
Ser Ser Met Leu Glu Leu Trp Asu Leu Met Asp Tlir Pro Leu Glu Glu 
255 260 265 

cag cag atg ttt cag aat ata aca tgc aat alt get get tea gaa caa 1046 
Gin Gin Met Phe Gin Asn He Thr Cys Asn He Ala Ala Ser Glu Gin 
270 275 280 

gag ata act gaa cca aac acc etc tec aca gat ttc ctg aat tat gtc 1094 
Glu lie Thr Glu Pro Asn Thr Leu Ser Thr Asp Phe Leu Asn Tyr Val 
285 290 295 

gaa tct gag gtg tta agg ctt gaa caa ctg aaa gca agt aag atg aaa 1142 
Glu Ser Glu Val Leu Arg Leu Glu Gin Leu Lys Ala Ser Lys Met Lys 
300 305 310 315 

gat ctt gtt tta aaa aag aaa gca gaa eta gaa gag cat aga aga egt 1190 
Asp Leu Val Leu Lys Lys Lys Ala Glu Leu Glu Glu His Arg Arg Arg 
320 325 330 

get cat ctt gtt ggc gag gaa ggt tat gca gag gag ttt age att gaa 1238 
Ala His Leu Val Gly Glu Glu Gly Tyr Ala Glu Glu Phe Ser lie Glu 
335 340 345 

get att gaa get gga get att gat ccc tea eta gta ctt gaa caa att 1286 
Ala lie Glu Ala Gly Ala He Asp Pro Ser Leu Val Leu Glu Gin He 
350 355 360 

gaa get cac att gca aca gtg aaa gag gaa get. ttt age egg aag gat 1334 
Glu Ala His He Ala Thr Val Lys Glu Glu Ala Phe Ser Arg Lys Asp 
365 370 375 



- 21 - 



aft ctt gag aaa gtt. gaa aga tgg caa aat get tgt gaa gag gaa gec 1382 
He Leu Glu Lys Val Glu Arg Trp Gin Asn Ala Cys Glu Glu Gin Ala 
380 ' 386 390 395 



tgg ctg gaa gat tac aac aaa gat gat aat cgt tac aat get ggg agg 1430 
Trp Leu Glu Asp Tyr Asn Lys Asp Asp Asn Arg Tyr Asn Ala Gly Arg 
400 405 410 

gga gca cat eta aca eta aag agg get gaa aag get cgt act tig gtc 1478 
Gly Ala His Leu Tbr Leu Lys Arg Ala Gin Lys Ala Arg Thr Leu Val 
415 420 425 

aac aag att ect gga atg gta gat gtt ttg aga aca aaa att get gca 1526 
Asn Lys He Pro Gly Met Val Asp Val Len Arg Thr Lys lie Ala Ala 
480 435 440 



tgg aaa aat gaa cga gga aag gag gat ttc aca tat g&t ggt gtt age 1574 
Trp Lys Asn Glu Arg Gly Lys Glu Asp Phe Thr Tyr Asp Gly Val Ser 
445 450 455 

ctt teg tea atg ctt gat gaa tat atg ttc gtt cgt cag gag aaa gag 1622 
Leu Ser Ser Met Leu Asp Glu Tyr Met Phe Val Arg Gin Glu Lys Glu 
460 465 470 476 

caa gag aag aag aga caa agg gat cag aag aag etc cag gat cag etc 1670 
Gin Glu Lys Lys Arg Gin Arg Asp Gin Lys Lys Leu Gin Asp Gin Leu 
480 485 490 



aaa gcg gag cag gaa get ttg tac gga tea aaa ecc agt cca tec aag 1718 
Lys Ala Glu Gin Glu Ala Leu Tyr Gly Ser Lys. Pro Ser Pro Ser Lys 
495 500 505 



ecc eta agt aca aag aag gca cct agg cac tct atg ggt ggt gca aac 1766 
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Pro Leu Ser Thr Lys Lys Ala Pro Arg His Ser Met Gly Gly Ala Asn 
510 £16 520 



cga agg eta tct ctt ggt gga gec acc atg caa ccc ccg aag act gat 1814 
5 Arg Arg Leu Ser Leo Gly Gly Ala Thr Met Gin Pro Pro Lys Thr Asp 
525 580 535 

ata ctg cat tea aag tct gtt cgt get gec aag aaa act gaa gaa ate 1862 
lie Leu His Ser Lys Ser Val Arg Ala Ala Lys Lys Thr Gin Glu He 
10 540 545 550 555 



15 



ggc act ttg tec cct agt agt agt aga ggt ttg gac att gec gga ttg 1910 
Gly Thr Leu Ser Pro Ser Ser Ser Arg Gly Leu Asp lie Ala Gly Leu 
660 665 570 

cct ate aag aag ttg tct tic aat gec agt act eta cgt gag acg gag 1958 
Pro He Lys Lys Leu Ser Phe Asn Ala Ser Thr Leu Arg Glu Thr Glu 
575 580 585 



20 aca cct cgt aaa cct ttt get cag ate aca cca gga aac agt gtc teg 
Thr Pro Arg Lys Pro Phe Ala Gin lie Thr Pro Gly Asn Ser Val Ser 
590 595 600 



2006 



teg acg cct gtg cgc cct ate acc aat aac act gag gat gat gag aac 
25 Ser Thr Pro Yal Arg Pro He Thr Asn Asn Thr Glu Asp Asp Glu Asn 
605 610 615 



2054 



agg act ccg aag aca ttt aca gca ctg aat ccc aag act ccg atg act 2102 
Arg Thr Pro Lys Thr Phe Thr Ala Leu Asn Pro Lys Thr Pro Met Thr 
30 620 625 630. 636 

gtt acg get cca atg cag atg gca atg act ccc tct ctg gee aac aag 2150 
Val Thr Ala Pro Met Gin Met Ala Met Thr Pro Ser Leu Ala Asn Lys 
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640 645 650 

gtt tea gca act cca gtl tec ctt gti tac gac aag cca gag gta aca 2198 
Val Ser Ala Thr Pro Yal Ser Leu Val Tyr Asp Lys Pro Glu Val Thr 
5 655 660 665 

tig cag gag gac ate gac tac tec ttt gaa gaa agg egg etc gee ate 2246 
Leu Gin Glu Asp He Asp Tyr Ser Phe Glu Glu Arg Arg Leu Ala lie 
670 675 ' 680 

10 

tat ctg gee agg caa atg gtt taa etgttgatca atttatgUc glagttgaaa 2300 
Tyr Leu Ala Arg Gin Met Val 
686 690 

15 tetgactgea ttttcttgtc ggtggccatt gcgtatgttg gtcaacaala gtcggccttt 2360 

ccagtagcac tattctgatt tactgeaatt gttttaatgt tttctacaac cagtaaaaca 2420 

gctctataca ttagcttget cactaaaaaa aaaaaaaaaa aaaaaaaa 2468 

20 

<210> 2 

<211> 690 

<212> PRT 

<213> Oryza sativa 

25 

<400> 2 

Met Ser Ser Ala Yal Lys Asp Gin Leu His Gin Met Ser Thr Thr Cys 

1 5 10 16 

Asp Ser Leu Leu Leu Glu Leu Asn Val lie Trp Asp Glu Val Gly Glu 
30 20 25 30 

Pro Asp Thr Thr Arg Asp Arg Met Leu leu Glu Leu Glu Gin Glu Cys 

35 40 45 

Leu Glu Yal Tyr Arg Arg Lys Val Asp Gin Ala Asn Arg Ser Arg Ala 
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50 55 60 

Gin Leu Arg Lys Ala lie Ala Glu Gly Glu Ala Gin Leo Ala Gly lie 
65 70 75 80 

Cys Ser Ala Met Gly Glu Pro Pro Yal His Yal Arg Gin Ser Asn Gin 

85. 90 95 

Lys Leu His Gly Leu Arg Glu Glu Leu Asn Ala lie Yal Pro Tyr Leu 

100 105 110 

Glu Glu Met Lys Lys Lys Lys Val Glu Arg Trp Asn Gin Phe Yal His 

116 120 1 125 

Yal lie. Glu Gin He Lys Lys lie Ser Ser Glu lie Arg Pro Ala Asp 

130 185 140 

Phe Yal Pro Pbe Lys Yal Pro Val Asp Gin Ser Asp Leu Ser Leu Arg 
145 160 156 160 

Lys Leu Asp Glu Leu Thr Lys Asp Leu Xaa Ser Leu Gin Lys Glu Lys 

165 170 175 

Ser Asp Arg Leu Lys Gin Yal lie Glu His Leu Asn Ser Leu His Ser 

180 185 190 

Leu Cys Glu Val Leu Gly He Asp Phe Lys Gin Thr Yal Tyr Glu Yal 

195 200 205 

His Pro Ser Leu Asp Glu Ala Glu Gly Ser Lys Asn Leu Ser Asn Thr 

210 215 220 

Thr lie Glu Arg Leu Ala Ala Ala Ala Asn Arg Leu Arg Glu Met Lys 
225 230 235 240 

He Gin Arg Met Gin Lys Leu Gin Asp Phe Ala Ser Ser Met Leu Glu 

245 250 255 

Leu Trp Asn Leu Met Asp Thr Pro Leu Glu Gin Gin Gin Met Phe Gin 

260 265 270 

Asn lie Thr Cys Asn lie Ala Ala Ser Glu Gin Glu He Thr Glu Pro 

275 280 285 

Asn Thr Leu Ser Thr Asp Phe Leu Asn Tyr Yal Glu Ser Glu Yal Leu 

290 295 300 

Arg Leu Glu Gin Leu Lys Ala Ser Lys Mel Lys Asp Leu Val Leu Lys 
305 310 315 320 
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Lys Lys Ala Glu Leu Gin Gin His Arg Arg Arg Ala His Leu Val Gly 

325 330 335 

Glu Glu Gly Tyr Ala Glu Glu Phe Ser He Glu AU lie Glu Ala Gly 

340 345 350 

Ala He Asp Pro Ser Leu Yal Leu Glu Gin He Glu Ala His He Ala 

355 360 365 

Thr Val Lys Glu Gin Ala Phe Ser Arg Lys Asp He Leu Glu Lys Val 

370 375 380 

Glu Arg Trp Gin Asn Ala Cys Glu Gin Glu Ala Trp Leu Glu Asp Tyr 
385 390 396 400 

Asn Lys Asp Asp Asn Arg Tyr Asn Ala Gly Arg Gly Ala His Leu Thr 

405 410 415 

Len Lys Arg Ala Glu Lys Ala Arg Thr Len Val Asn Lys He Pro Gly 

420 425 430 

Met Val Asp Val Leu Arg Thr Lys lie Ala Ala Trp Lys Asn Glu Arg 

435 440 446 

Gly Lys Gin Asp Phe Thr Tyr Asp Gly Val Ser Leu Ser Ser Met Leu 

450 456 460 

Asp Glu Tyr Met Phe Val Arg Gin Glu Lys Gin Gin Glu Lys Lys Arg 
465 470 475 480 

Gin Arg Asp Gin Lys Lys Leu Gin Asp Gin Leu Lys Ala Glu Gin Glu 

486 490 495 

Ala Len Tyr Gly Ser Lys Pro Ser Pro Ser Lys Pro Leu Ser Thr Lys 

500 505 510 

Lys Ala Pro Arg His Ser Met Gly Gly Ala Asn Arg Arg Leu Ser Leu 

S15 520 525 

Gly Gly Ala Thr Met Gin Pro Pro Lys Thr Asp He Len His Ser Lys 

530 535 540 

Ser Yal Arg Ala Ala Lys Lys Thr Gin Gin lie Gly Thr Len Ser Pro 
546 550 555 560 

Ser Ser Ser Arg Gly Leu Asp lie Ala Gly Len Pro He Lys Lys Leu 

565 570 575 

Ser Phe Asn Ala Ser Thr Leu Arg Glu Thr Glu Thr Pro Arg Lys Pro 
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580 585 590 

Phe Ala Gin He Thr Pro Gly Asn Ser Val Ser Ser Thr Pro Yal Arg 

695 600 605 

Pro He Thr Asn Asn Thr Gin Asp Asp Glu Asn Arg Thr Pro Lys Thr 

610 ' 61S 620 

Phe Thr Ala Len Asn Pro Lys Thr Pro Met Thr Val Thr Ala Pro Met 
625 630 636 640 

Gin Met Ala Met Thr Pro Ser Len Ala Asn Lys Yal Ser Ala Thr Pro 

645 650 665 

Yal Ser Leu Yal Tyr Asp Lys Pro Gin Val Thr Len Gin Gin Asp He 

660 665 670 

Asp Tyr Ser Phe Glu Gin Arg Arg Leu Ala He Tyr Len Ala Arg Gin 
675 680 685 

Met Yal 
690 



<210> 3 
<211> 4574 
<212> DNA 
<213> Oryza satlva 

<400> 3 

aaaaaaatat ttcaaatcac actacactct 
cctctcctcc gcctctctcg catctgaggc 
gccgccccgt ctcgccctcc ccgctcgacg 
gttcttcgca cctcgcgatg agtagcgcgg 
catgcgattc gcttctactg gagctcaatg 
ccggctactc gtgttggttc tggcatggca 
tcgttctgtt tcaggtgatt tgggatgagg 
tgctgctgga gctcgagcag gagtgcctgg 
accggagccg cgcccagctg cggaaggcca 
tctgctcagc catgggcgag ccgctcgtgc 



ccgtcgtctc ctctcctctc ctctcctccc 60 
tccgatcgcc ggcgacccca gccagaatcc 120 
agaccgcgcc gagcggcgaa gaggcctagt 180 
tgaaggacca gcttcaccag atgtcgacga 240 
tatgtcaccg cttgccgatt caaccatttc 300 
gtggaggatt tacggggttt ttttcttctc 360 
tcggtgagcc cgacacgacg agggacagga 420 
aggtctacag gcggaaggtc gaccaggcga 480 
tcgccgaggg cgaggcagag ctcgccggca 540 
acgttagaca ggttagtttc tggctccacc 600 
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AKUZX 



aatggctgta aaagaggtat cgcaiggttg 
aactgtgcta attggcg&tg gaagaaaagg 
ttctgttgta agatgatagt actactgctt 
tagaactcca tcagcagtta acatttttct 
5 aagggaaagt gt tggcagag ct tacat t tc 
agttactgct caatcggttc tatacttttt 
atatatgttt tgtaaaattt atctgtttcc 
atatttactg ttttaccggc agcatgacta 
gaatacaitc taccttcttg actaatcaat 

10 cagccattca tgcagtaaca tgcatttcat 
atggcttaag agaggagttg aatgcaattg 
aggtcgaacg atggaaccag tttgttcatg 
aaataaggcc agccgatttt gttcccttta 
taagaaagct tgatgagttg acgaaggacc 

15 tcactaatac catctttatc cattttcacc 
aagaatcctt ttcatttctt gtataaaatc 
cagagcgatc ggctaaagca agtgatagaa 
gtgcttggca tagatttcaa gcaaacagta 
gaaggatcaa agaacctgag caacactaca 

20 ctgcgtgaaa tgaagatcca aaggatgcaa 
gtatcaatga acactttcag tctttaactt 
ttgcttctag catgctcgag ctatggaatc 
tgtttcagaa tataacatgc aatattgctg 
ccctctccac agatttcctg aattatgtaa 

25 gttcgtactg tgttatattt tcattaagat 
taggtcgaat ctgaggtgtt aaggcttgaa 
gttttaaaaa agaaagcaga actagaagag 
gaaggttatg cagaggagtt (agcattgaa 
cttactgcct tttattgtgc ctgacaagtc 

30 tgttctgttc gcaggagcta ttgatccctc 
tgcaacagtg aaagaggaag cttttagccg 
gcaaaatgct tgtgaagagg aagcctggct 
tgaagctacg iggtctttgt atatttgttt 



gatcaaaaga tggaagtcga altcctgtgg 660 
aagatttagt agagaactaa aagctacgat 720 
gcattgttga tctgatggag gtaaaccgtg 780 
aactgattag lagtagcgta tcaatatatt 840 
tttctcactt ctattctcga clttatgccc 900 
actgctgttc ccatgcatta gcaatttagg 960 
ttcagtttga atatgttcag catgaataat 1020 
agttactgcc tcaagtacgt tttatttgtt 1080 
tctgcttgac tgtagatttt agcacttcct 1140 
etgaaatttt gcagtcaaat cagaagcltc 1200 
ttccgtattt ggaagaaatg aaaaagaaaa 1260 
tcatagagca gattaagaaa atttcgtctg 1320 
aagttccggt tgatcagtct gacctgtcat 1380 
tggaatccct tcagaaggag aaggtcatca 1440 
agtcatgttg tcatcglgtc tctatctatc 1600 
tcactatgcc atatacatgt ttgtttctca 1560 
catttgaatt ctttgcattc cttatgtgag 1620 
tatgaggtgc accctagctt ggacgaagct 1680 
attgagaggc ttgctgctgc cgcaaacaga 1740 
aaggtcagca ttgcctgtac cattgiagag 1800 
ggttaatctg attctggcag cttcaagatt 1860 
tcatggalac tccacttgaa gagc&gcaga 1920 
cttcagaaca agagataact gaaccaaaca 1980 
tttatcatca ctgagattgc aaaaatttat 2040 
atgaatgttc atcgactala cttataactg 2100 
caactgaaag caagtaagat gaaagatctt 2160 
catagaagac gtgctcatct tgttggcgag 2220 
gctattgaag ctggtaagal actctcctgc 2280 
ataccagaca gagttcatat acctggtctg 2340 
actagtactt gaacaaattg aagctcacat 2400 
gaaggatatt cttgagaaag ttgaaagatg 2460 
ggaagattac aacaaagtat ggatgctagc 2520 
agcaaataat gtggtactga tatctcctgg 2580 
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ctttggcttt ttttaggatg ataatcgtta 
aaagagggct gaaaaggctc gtactiiggt 
atttatgtgt ttggaacttc cttatcaagt 
cattactaca atctgatatc ctgctgattt 
5 aacaaaaatt gctgcatgga aaaatg&acg 
aggttttctt actcttacac attacattga 
cctttcttgc aattcttaca ggttagcctt 
cgtcaggaga aagagcaaga gaagaagaga 
atgtattgtc taaatcatct tttcaccttc 

10 tgcaggatca gaagaagclc caggatcagc 
caaaacccag tccatccaag cccctaagta 
gtgcaaaccg aaggctatct cttggtggag 
tgcattcaaa gtctgttcgt gctgccaaga 
gtaagcccta ctagctatca tgtgtcgata 

15 catatgtcta actcaagcaa acaatatcag 
ctatcaagaa gttgtctttc aatgccagta 
cttttgctca gatcacacca gg&aacagtg 
ataacactga ggatgatgag aacaggactc 
ctccgatgac tgttacggct ccaatgcaga 

20 tticagcaac tccagtttcc cttgtttacg 
tcgactactc ctttgaagaa aggcggctcg 
tgttgatcaa tttatgtacg tagttgaaat 
cgtatgttgg tcaacaatag tcggcctttc 
ttttaatgtt ttctacaacc agtaaaacag 

25 acagctttct cggcagcacg aaacatttct 
atagggatag ttactgttac atatactgta 
ggaggtatta taggaaggat cgttttggaa 
atcagtttct ggattgtccc tcgcaaagaa 
gtcccacctg actccttg&a agtcttctgg 

30 ggcgaacatg atgtgtgtgt cactctcgag 
actccagcaa ccaacaaacg agccagtcat 
ttttccacca ccgtcaagcc tcaagcaaaa 
atgaaatcga gcagctagtt gtgcctgcta 



caatgctggg aggggagcac atctaacact 2640 
caacaagatt cctggtaaig ttactcaatg 2700 
gcatatttaa tttacaailt taactcttgc 2760 
gtgctgagca ggaatggtag atgttttgag 2820 
aggaaaggag gatttcacat atgatggtgt 2880 
tcgggtctat ttttgtttct tgctgaagtg 2940 
tcgtc&atgc ltgatgaata tatgttcgtt 3000 
caaagggt&t latgctctcg cctaatattc 3060 
tgtgaatacg ctctaatact tgaatatacc 3120 
(caaagcgga gcaggaagct ttgtacggat 3180 
caaagaaggc acctaggcac tctatgggtg 3240 
ccaccatgca acccccgaag actgatatac 3300 
aaactgaaga aatcggcact ttgtccccta 3360 
Utttctttt tcctcttatt ttcacttgaa S420 
gtagtagagg lttggacatt gccggattgc 3480 
ctctacgtga gacggagaca cctcgtaaac 3540 
tctcgtcgac gcctgtgcgc cctatcacca 3600 
cgaagacatt tacagcactg aatccca&ga 3660 
tggcaatgac tccctctctg gccaacaagg 3720 
acaagccaga ggtaacattg caggaggaca 3780 
ccatctatct ggccaggcaa atggtttaac 3840 
ctgactgcat tttcttgtcg gtggccattg 3900 
cagtagcact attctgattt actgcaattg 3960 
ctctatacat tagcttgctc actactcagt 4020 
gttctctttg atga&tactt cttgctgtgg 4080 
tgcccttcag aatagaaacc tgttagtacg 4140 
ttttggtggt lagcctgcac agtaagttcc 4200 
aaaagttttc ttgattctgg taattcgttt 4260 
acatgggaag ctatcgtatc gtatcgctcg 4320 
tgagcaggcc accgaaggct gacttgactg 4380 
ttttcacccc gggtttttgt cccaaaacac 4440 
ccaaaacgct acgtaacgcc catcaacacc 4500 
ctggcccccc agtgccctgt accgcccgtt 4660 
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44X1 vr 4. X 



cttctcactc gac'a 



4574 



